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Abstract. An approach towards a statistical survey of four dimensional supersymmetric vacua in the string 
theory landscape is described and illustrated with three examples of ensembles of intersecting D-brane 
models. The question whether it is conceivable to make predictions based on statistical distributions is 
discussed. Especially interesting in this context are possible correlations between low energy observables. 
As an example we look at correlations between properties of the gauge sector of intersecting D-brane 
models and Gepner model constructions. 
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' 1 Introduction 

■ As has been known for quite a while but only more widely 
I discussed recently [1], string theory provides us with an 

■ extremely large number of effective four dimensional the- 
' ories. The main reason for this lies in the abundance of 
. different ways how to compactify the theory from ten to 

■ four dimensions. This procedure is by no means unique 
^ and produces massless moduli fields in the effective the- 
' ory. One way to introduce a potential for these moduli and 
' thereby fix their values is the use of background fluxes (for 
, a recent review in the context of intersecting brane models 
' see [5]), but even after using this method we are left with 
I an abundance of possibilities. 

Facing such a huge number of possible low energy the- 
ories, we have to answer the question why exactly the 
(supersymmetric) standard model is realised in nature. 
One certainly has to try to identify a selection mechanism 
within string theory that singles out a particular solution 
(e.g. based on entropy) or one has to face the possibihty 
of retreating to anthropic reasoning. 

However, it might be possible to extract some useful in- 
formation from a statistical analysis of solutions by search- 
ing for common properties within ensembles of models at 
different points in the landscape. This might give valu- 
able hints for model building by excluding or highlighting 
regions of the parameter space. 

Moreover, looking for correlations between low energy 
observables might be an interesting possibility [51I1]. If 
found, these correlations could not only teach us some 
lessons about the general behaviour of string theory mod- 
els and help to identify underlying principles, but they 

Email: fgmeiner@nikhef.nl 



might even lead to concrete results. If it should turn out 
that certain correlations exist in a wide variety of mod- 
els, one could conjecture them to be true in general and 
thereby obtaining testable predictions for experiment. 

Besides these interesting, desired correlations, we have 
to take a different type of unwanted correlations into ac- 
count as well [5] . In the analysis of ensembles of models 
it can be desirable or even necessary to infer from the 
properties of a particular subset of models to the distri- 
bution of these features in the whole class. In the simplest 
case these subsets are randomly chosen, but most of the 
time one has no other choice then to introduce a bias on 
the basis of which models are feasible to calculate. In this 
case one has to be very careful not to run into unwanted 
correlations. Concretely, the expectation value of the sta- 
tistical distribution might explicitly depend on the choice 
function. 

How should one proceed to obtain statistical data on 
string compactifications? There are basically two routes 
one could follow. One of them relies on a true statistical 
approach, using an approximated measure for the space 
of models (6j. The other one uses ensembles of explicitly 
calculated models at specific points in the landscape to 
compute frequency distributions of their properties, which 
eventually can be extrapolated to a wider class of com- 
pactifications. This is the approach followed here, in par- 
ticular we will consider frequency distributions of proper- 
ties of intersecting brane models [3,7,8,9, lOj. Other cor- 
ners of the landscape that have been studied using a sim- 
ilar method include Gepner models pT|[T2] , which we will 
use later to compare results on correlations, and orbifold 
compactifications of heterotic string theory [T3] . 
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2 Intersecting brane models 

We will work with simple orientifold models of type IIA, 
compactified on toroidal orbifold backgrounds and equipped 
with D6-branes at angles [14]. These models have been 
studied in great detail over the last years and are still be- 
ing used for phenomenological model building 15J. How- 
ever, to be able to study large quantities of different con- 
structions (or even all possible models in a given back- 
ground) , some important points will not be taken into ac- 
count. One of them is the question of moduli stabilisation, 
in particular the inclusion of background fluxes, another 
one are contributions of non-perturbative effects, such as 
instanton corrections. We believe however, that the fre- 
quency distributions obtained in this simplified setup can 
be used as a basis for a refined analysis and most of their 
properties are not going to change qualitatively. 

Specifically, the backgrounds we consider are of the 
form 

R^-^xM, M = T^/G, G e {Z2xZ2,Z6,Z^}, 

where Zg and Zg denote two different embeddings of the 
group action into the torus-lattice. The orientifold pro- 
jection consists of dividing out worldsheet parity, accom- 
panied by a reflection along three of the torus axes in 
space-time. This introduces topological defects, which are 
described by orientifold 06-planes that carry tension and 
are charged under the RR-seven-form. 

To account for the tadpoles introduced by the 06- 
planes, one introduces D6-branes in the background, which 
are space-time filling and wrap, as the orientifold planes, 
Lagrangian three-cycles in the compact space. These cy- 
cles can be parametrised using a basis of H3 (M, Z) . It can 
be split into one part that comes from those torus cycles 
that survive the projections (called bulk cycles in the fol- 
lowing) and another part that is due to the existence of 
exceptional cycles at the fixed points of the orbifold ac- 
tion, which are related to the possible blow-up modes. 

For the three geometries under consideration the situ- 
ation is rather different. In the case of T^/(Z2xZ2), we do 
not have exceptional cycles, such that the basis consists of 
torus cycles only. In the other two cases exceptional cycles 
do exist and combine with the torus cycles to fractional 
cycles. The existence of exceptional cycles makes a huge 
difference for the statistics, as we will see below. 

Not every possible background will lead to a valid 
model. In order to obtain consistent compactifications, 
there are several consistency conditions that have to be 
fulfilled. Two of them have already been alluded to, namely 
the cancellation of tadpoles coming from the RR-seven- 
form, which reads in homology 

Y,NaiT^a+K) = L7ro6, (1) 
a 

where tt are the three-cycles wrapped by the brane a, it's 
orientifold mirror a' and the 06-planes. L denotes the 
orientifold charge and the sum runs over all stacks of D6- 
branes in the model, each consisting of Na branes. For a 



background with third Betti number 63 there are 63/2 such 
conditions. Moreover, since we are looking for supersym- 
metric models in four dimensions, which leads to the con- 
straint that all three-cycles have to be special Lagrangian, 
we have to impose the condition that the calibration form 
vanishes when restricted to the three-cycle. Addition- 
ally anti-branes should be excluded from the spectrum, 
leading to the condition that the real part of has to be 
positive, 

lmf23\^ = 0, Rel?3 > 0. (2) 

One last constraint comes from K-theory and can be for- 
mulated in our setup as a condition on intersections be- 
tween all brane stacks and some orientifold invariant probe- 
branes, 

NaTTa ° TTprobc = mod 2. (3) 

a 

More details on the different geometries, consistency 
conditions and brane embeddings can be found in the re- 
spective papers [51[^[TD]. 

Each stack of branes carries a gauge group G(N) 
on it's worldvolume, where G € {U,Sp,SO}, depend- 
ing on whether the three-cycle it wraps coincides with 
the orientifold plane. Matter arises at the intersection of 
brane stacks and their orientifold mirrors. One can distin- 
guish between chiral and non-chiral multiplets and com- 
pute their multiplicities in terms of the intersection num- 
bers between the relevant cycles. For two stacks a and b 
with Na and A^;, branes we obtain chiral matter in the 
bifundamental representation (Na, Nb) with multiplicity 
lab — T^a^T^b and in the representation (Na, Nb) with mul- 
tiplicity lab' — T^a ° T^'ii- In 3- similar manner one can com- 
pute the non-chiral multiplets [9]. Moreover, each brane 
might contribute matter in the adjoint, symmetric and 
antisymmetric representations of the gauge group G{Na). 



3 Statistical distributions 

To evaluate the models described in the last section statis- 
tically, large ensembles of solutions to the constraints ^ , 
and ([3]) can be generated using computer algorithms, 
thereby making use of the fact that after the introduction 
of a suitable basis one can express everything in terms of 
integer valued algebraic equations. 

A complication that arises at this point is that the 
problem of classifying all possible solutions to the system 
of equations is NP-complete (16j . However, the number 
of solutions can be shown to be finite, so it depends on 
particular problem whether it is possible to find all possi- 
ble solutions within reasonable timescales. In the case of 
Zg this is indeed possible, because the set of bulk cycles 
is very restricted. For the other two backgrounds under 
consideration an explicit construction of the full space of 
solutions cannot be achieved, which makes it necessary to 
impose a restriction to a subset of models. From the prop- 
erties of the subsets one can then deduce the frequency 
distributions for the full set of solutions. 
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Fig. 1. Distributions of the probability to find a single gauge group factor of rank in the full ensemble of intersecting brane 
models on jil.iY.'LT) (left), T^Ze (middle) and tV^s (right). 



The subsets have to be chosen in such a way that no 
unwanted bias is introduced that would distort the sta- 
tistical distributions. In the case of T^/(Z2xZ2) we have 
used a cut-ofF on the space of the real part of the complex 
structure which is one of the free parameters. In this way 
some interesting models are not captured by the analysis, 
but the resulting set of solutions is large enough to obtain 
valid statistical distributionfQ. For the T^/Zg background 
a different method has been used, namely a restriction to 
several randomly generated subsets of models, that have 
been tested afterwards to make sure that they do not suf- 
fer from unwanted correlations. 

The total number of solutions is quite different for the 
three cases at hand. For TV(Z2 xZa) there are 0(10^°) 
models and in the caseof T^^e and T'^ jU^ we find ©(lO^**) 
and 0(10^^) solutions, respectively. The huge differences, 
in particular between the first and the latter two back- 
grounds, is due to the effect of exceptional branes. One 
can show that the tadpole constraints ([1]) split into a bulk 
part and an exceptional part, such that one can treat the 
two sets of cycles independently. Each bulk brane can be 



combined with one of 
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2' different possible excep- 



tional branes to form a fractional cycle, but not all of them 
fulfil the consistency conditions, such that the number of 
possibilities is reduced. For a model with k stacks this 
amounts to n\ combinations. Not all of these lead to con- 
sistent models, but the restrictions from the exceptional 
part of the tadpole equations are only polynomial, leading 
to an exponential enhancement of the space of solutions. 

As an example for the frequency distribution of gauge 
sector properties, we will take the probability to find a 
semi-simple gauge factor of rank TV within one model. 
Combining the probability of several gauge factors, one 
can estimate the frequency of certain gauge groups, such 
as the one of the standard model, for example. As one 
can see clearly from the distributions for the three ge- 
ometries (Fig. [U, the r6/(Z2xZ2) orbifold differs quite 



^ If one restricts the survey to models with certain properties, 
e.g. specific gauge groups, or fixes the number of brane stacks, 
statements about the full set of solutions are possible [8]. 



dramatically from the other two geometries. This is again 
an effect of the contribution of exceptional cycles, which 
can be quantified in this case. 

For T^/(Z2 XZ2) the distribution for an ensemble of 
models with given number of stacks is proportional to 
L^/iV^ [8]. Including a sum over all possible stack sizes 
fc and, in the case of models with exceptional stacks, the 
exponential enhancement factor n(fc), all three distribu- 
tions can be approximated by 



L + l-AT 



fc=l 



^ k 



(4) 



where ne = 1 for models without exceptional cycles. In the 
case of r6/(Z2xZ2) we obtain {L + 1)L^ /N"^ -L'^ /N , while 



for the two embeddings of Zg we find L n, 



4 (T-l-l-AT) 



/iV2 



4 Correlations 

As mentioned in the introduction, the most promising 
route to obtain results that may give rise to testable pre- 
dictions is to look for correlations between low energy ob- 
servables. 

In the following we will consider only one simple exam- 
ple of correlations between properties of the gauge sector 
to show how this might be done in practise, but certainly 
many more possibilities could be considereqfl. 

Within the ensemble of models described above, for 
each pair of branes a and 6, we always obtain a pair of 
chiral matter in the bifundamental representation of the 
two gauge group factors, coming from the intersection of 
the two branes and the intersection of one brane with the 
orientifold mirror of the other one. One can define the two 
quantities 



^ — \Iab ± lab 



(5) 



^ This section contains some preliminary results of work in 
progress with Tim Dijkstra. 
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Fig. 2. Normalised frequency distribution of intersection numbers A^, as defined by ([5]), for Gepner models (left) and intersect- 
ing brane models on r®/(Z2xZ2) (middle) and T^/Ze (right). The small inserts show the pattern that emerges from randomly 
generated intersections using the same set of branes. 



invariant under the exchange of branes and the orientifold 
action, that describe the net amount of chiral matter for 
one particular brane. 

We will use these two quantities as an example of how 
a correlation can arise in the construction, and therefore 
in the amount of chiral matter that shows up in the ef- 
fective theory. To show that this effect does not depend 
on the specific geometry, we compare the analysis of the 
correlation pattern between intersecting brane models on 
T^/ (Z2 XZ2) and T^/Ze with a similar analysis of Gepner 
model constructions [l^ . To see how far the actual distri- 
bution diverges from a generic match of branes, we use a 
distribution with randomly generated pairings within the 
set of branes of the model under consideration. 

Although the distributions (Fig. [J) are quite differ- 
ent quantitatively, at a qualitative level the distribution 
of intersection numbers is very similar for the intersect- 
ing brane models and the Gepner model constructions. In 
particular the tendency towards either identical or rather 
distant values for is common in all three distributions. 
This is quite remarkable, since one has to keep in mind 
that the Gepner models are not only located at a different 
point in moduli space, but the ensemble considered here 
consists of a sample of several thousand different Gep- 
ner models, all corresponding to different backgrounds, of 
which only very few have even a geometrical interpreta- 
tion. 

The fact that the distribution for r^/(Z2xZ2) looks 
a bit blurred, can be traced back to the fact that the en- 
semble under consideration has been cut off at high values 
of the complex structure modulus, as explained above. In 
the case of T^/Zg we are considering a random subset of 
models that include also exceptional branes, which makes 
the ensemble exponentially larger and at the same time 
reduces the number of possible values for intersections [9j . 
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